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NOTES 



ON A NEW METHOD OF COMPUTING FREQUENCY 
PERCENTAGES 

By Robert E. Mohitz, University of Washington 



Suppose there be given the frequencies 

fit fit fh • • ■ t frit 

whose sum is 

F**fi+f,+f*+ ■ • ■ +fn, 
and it is required to compute the frequency percentages 
Pk=100f k /F, fc = l, 2, 3, . . .,n. 

The method commonly employed is to compute the ratios f k /F by 
means of a slide-rule, or, if a slide-rule is not available, to compute the 
ratio 100/F and then multiply, logarithmically or otherwise, each of 
the numbers fk by this common factor. 

Some time ago, while spending a vacation period at an island camp, 
the writer unexpectedly interested himself in a statistical inquiry which 
involved the computation of several hundred tables of frequency 
percentages. Being deprived of the usual aids in computation, he hit 
upon the following method of approximation, which, as far as sim- 
plicity and rapidity is concerned, makes the usual aids almost, if not 
entirely, superfluous. Moreover, the writer is now able to show that 
the results obtained by this method are sufficiently accurate for all 
practical purposes — the method admitting, in fact, of extension so as 
to secure any desired degree of accuracy. 

The method referred to consists of two steps. 

(1) The frequencies /*. are replaced by proportional numbers f' k 
whose sum F' approximates 100. 

(2) The difference D = 100 — F' is distributed over the numbers f\ 
in accordance with the formula Af' k =f' k • D/100, where A/' k is that 
portion of the difference D which falls to f' k . 

It is obvious that when F' = 100, there remains no difference to be 
distributed, and the numbers f' k are the true frequency percentages. 
If, however, F' differs from 100, as is usually the case, the numbers f' k , 
augmented as directed in (2), will differ slightly from the true fre- 
quency percentages, their true values being f' k +f' k • D/F'., while our 
formula gives f' k +f'k ■ D/100. 
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We shall illustrate the method by some examples before we con- 
sider the magnitude of the errors involved. 

In the tables that follow, f k represents the given frequencies, /' *. the 
derived frequencies obtained as directed in (1), Af' k =f' k • D/100, 
the portions of the difference D = 100— F' to be added to /'*, and 
p'k=fk+&f'k> the approximate frequency percentages. 



TABLE I 



k 


f'k 


f'k 


Mh 


*>'* 


1 


4 


6. 


+0.06 


6.06 


2 


15 


22.5 


0.225 


22.725 


3 


23 


34.5 


0.345 


34.845 


4 


17 


25.5 


0.255 


25.755 


5 


7 


10.5 


0.105 


10.605 



i?=66 i?'=99.0 
(1) /'i=/i+$4- (2)100-F'=1. A/'j.=l% of f' k . 

TABLE II 



99.99 (check) 



k 


/* 


/'* 


A/-* 


P'k 


1 


5 


6. 


—0.05 


5.95 


2 


10 


12. 


0.10 


11.90 


3 


13 


15.6 


0.13 


15.47 


4 


24 


28.8 


0.23 


28.57 


5 


16 


19.2 


0.15 


19.05 


6 


9 


10.8 


0.09 


10.71 


7 


7 


8.4 


0.07 


8.33 



F=84 F'=100.8 



99.98 (check) 



The results have been rounded off to two decimal places. 



(1) f'k=fk+% 10O-F'=-0.8. Af k —£%otf k . 



TABLE III 



k 


h 


>'* 


Af' k 


P't 


1 


9 


5.4 


—0.14 


5.26 


2 


24 


14.4 


0.36 


14.04 


3 


30 


18.0 


0.45 


17.55 


4 


40 


24.0 


0.60 


23.40 


5 


31 


18.6 


0.46 


18.14 


6 


27 


16.2 


0.40 


15.80 


7 


10 


6.0 


0.15 


5.85 



F=171 F'=102.6 



100.04 (check) 



(1) /'j.-— A. (2) 100-*"— 2.6, Af' k =-- of 10% of f' k (nearly). 
"10 4 
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TABLE IV 



k 


h 


f'k 


A/'i 


V'k 


1 


18 


0.72 


+0.005 


0.725 


2 


42 


1.68 


0.012 


1.692 


3 


85 


3.40 


0.024 


3.424 


4 


101 


4.04 


0.028 


4.068 


5 


150 


6.00 


0.041 


6.041 


6 


202 


8.08 


0.055 


8.135 


7 


300 


12.00 


0.082 


12.082 


8 


450 


18.00 


0.122 


18.122 


9 


373 


14.92 


0.101 


15.021 


10 


252 


10.08 


0.069 


10.149 


11 


178 


7.12 


0.048 


7.168 


12 


120 


4.80 


0.033 


4.833 


13 


90 


3.60 


0.025 


3.625 


14 


59 


2.36 


0.016 


2.376 


IS 


50 


2.00 


0.014 


2.014 


16 


13 


0.52 


0.004 


0.524 
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F=2483 F'=99.32 



99.999 (cheek) 



(1) A«— /*■ (2) 100-F'=0. 68=0. 7 nearly. A/V= — of 1% of f t (neatly). 
* 100 "10 * 



The values of p\ are correct to three decimal places For many pur- 
poses the values /'& give sufficiently close approximations of the 
values Pfc. 

When the method is once clearly understood, the corrections Af' k 
need not be tabulated but may be immediately applied to the /Vs 
and the results entered in the column p'*. 

Let us now consider the magnitude of the errors introduced into 
the values of p'k because of (2). Let p* represent the true values of 
the frequency percentages, and let 

Pk=f'k+c-f' k , 

where c is the fraction of f' k that must be added to /'* to obtain j> k . 
On summing all the values of p k we have 

2p*=Z(/'*+c • f k ) =S/' ft +c • 2/'*, 

from which, since Sp A =100, and 2/'*=.?", 

100 =F'+c-F' 

that is 

c= (100 -F')/F'. 

The true frequency percentages are therefore given by the formula 
Pk=f'k+f'k(100-F')/F', 

while 

pW*+f*(100-F')/100. 
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We may now calculate the error in p' k , namely, 

ft-pW»(100-y)^ - ~) =f k m ~ F,)i ; 
vu Jkv j\ F , 1QO j jk 10QF , , 

also, the sum of all the errors 

F 100F' Jh 100 ' 

and the relative error 

100-Zp' fc _ri00-F'l 2 



R- 



100 



i'tT lOO-FV 
L 100 J 



The last formula shows that in the case of the four examples given, 
the relative errors do not exceed 0.0001, 0.00007, 0.0007, and 0.00005 
respectively. F' may therefore vary from 100 within much wider 
limits than in the case of the examples cited and yet yield values of p k 
sufficiently accurate for all practical purposes. If F' is anywhere 
between 95 and 105, the relative error will be less than J of 1 per cent. 
Moreover, it is easy to determine beforehand the limits within which 
F' must be chosen so as to give any preassigned accuracy to the values 
of pk. For example, if the relative error is to be less than T V of 1 per 

cent, we have I" 100 ^ ~| 2 <0 001 from ^fa 96.9 <i<"< 103.2. 
L 100 J 



STANDARDIZATION OF STATISTICS IN STATE 

INSTITUTIONS* 

By Horatio M. Pollock, New York State Hospital Commission 



The past half century has witnessed an enormous expansion of state 
institutions for the care of dependents, defectives, and delinquents. 
Prior to the Civil War there were in the United States but 24 state pris- 
ons or reformatories, 29 state asylums for the insane, and 44 state in- 
stitutions for other classes. The last were principally schools for the 
blind and the deaf. The humanitarian movement inaugurated in the 
early forties by Dorothea L. Dix received new impetus upon the recov- 
ery of the states from the ravages of the war, and new state institutions 
rapidly sprang into being. State boards for the supervision of the 
several classes of dependents were created, and the policy of state care 
gradually became established. There are now about 600 state institu- 
tions in the United States, housing over 400,000 patients, inmates, or 
pupils, and representing an investment of over $400,000,000. The 

*Read at the Eighty-second Annual Meeting of the American Statistical Association, Atlantic 
City, New Jersey, December, 1920. 



